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For the A -calculus with letrec we develop an optimisation, which is based on the contraction of a 
certain class of 'future' (also: virtual) redexes. 

In the implementation of functional programming languages it is common practice to perform 
j3 -reductions at compile time whenever possible in order to produce code that requires fewer re- 
ductions at run-time. This is, however, in principle limited to redexes and created redexes that are 
'visible' (in the sense that they can be contracted without the need for unsharing), and cannot gener- 
ally be extended to redexes that are concealed by sharing constructs such as letrec. In the case of 
recursion, concealed redexes become visible only after unwindings during evaluation, and then have 
to be contracted time and again. 

We observe that in some cases such redexes exhibit a certain form of repetitive behaviour at 
run time. We describe an analysis for identifying binders that give rise to such repetitive reduction 
patterns, and eliminate them by a sort of predictive contraction. Thereby these binders are lifted 
out of recursive positions or eliminated altogether, as a result alleviating the amount of j3 -reductions 
required for each recursive iteration. 

Both our analysis and simplification are suitable to be integrated into existing compilers for 
functional programming languages as an additional optimisation phase. With this work we hope to 
contribute to increasing the efficiency of executing programs written in such languages. 

In this extended abstract we report on work in progress carried out within the framework of the NWO 
project Realising Optimal Sharing. Instead of discussing optimal reduction in the A -calculus, however, 
here we are concerned with a static analysis of A/ e , rec -terms which aims at removing j3 -redexes that are 
concealed by recursion constructs and cause cyclic migration of arguments during evaluation. We have 
to stress that our research on this particular topic is still in an early phase. 

1 Introduction 

In this work we study terms in Xi etrec , i.e. in A-calculus with an explicit letrec-construct for recursive 
definitions, that exhibit a form of repetitive reduction pattern when evaluated. We try to identify a class of 
such terms for which this behaviour can be avoided by a transformation into a term with, in some sense, 
the same semantic denotation. Even though the presented optimisation can be described directly for 
A/ efrec -terms, and hence is applicable in all functional languages of which Xi etrec is a meaningful abstrac- 
tion, we will use Haskell to denote examples of such terms and their optimised equivalents. Additionally, 
we depict terms as A -graphs with explicit sharing-nodes {multiplexers) as used, for example, in 0. 
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A function well-known to Haskell programmers is the repeat function that generates an infinite, 
constant stream of the supplied argument. A definition is easily found, namely: 

repeat x = x : repeat x 

An experienced Haskell programmer, however, would spot a 'space leak', which refers to an 0(n) mem- 
ory consumption for generating n stream elements while 0{ 1 ) is possible, due to lazy evaluation. There- 
fore in the Haskell standard libraries that function is defined as: 

repeat x = let xs = x : xs in xs 

The exact reasons for this difference in efficiency involve the characteristics of the deployed Haskell 
compiler and run-time system. A more direct and theoretical explanation can be attempted within the 
formal framework of X\ etrec : The improved variant of repeat does not require any j3 -reductions to produce 
further stream elements. That becomes apparent by the A -graphs and their infinite unfoldings (Fig. [T}. 
We use a rewriting relation arrow indexed by a triangle (-> v ) to mark unfolding, regardless of whether 
sharing is expressed by a multiplexer or a letrec-binding. 



Figure 1 : Term graph representation of the two repeat implementations and their unfoldings 

From a software engineering perspective it is unsatisfactory that the programmer has to recognise 
and mitigate such cases. One might even consider the unoptimised version superior with respect to 
code clarity. Therefore we propose an analysis and transformation method to automate the optimisation, 
which then can be integrated into the compiler pipeline of existing functional language implementations. 
In the following sections we will work with simple examples to develop this method and successively 
generalise it for wider applicability. 

2 Preliminaries 

The method we describe applies to the A/ e?rec -calculus, which is a higher-order rewrite system. Still, in 
this work-in-progress report we primarily intend to motivate our research and outline the approaches 
developed so far. Hence, for the moment we give a largely informal description, and resort to first-order 
formulations, A -graphs, or Haskell, whichever seems more suited. 



Xx 




x 
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Definition 1 (First-order representation of Xi etrec ) Let V be a set of variables. Then a Xi etrec -term is 
defined as follows: 



(term) 



(definitions) Defs 



XV.T 
T T 
V 

letrec Defs in T 

v\-T ... v n -T 

vi , . . . , v„ 6 V all distinct 



(abstraction) 

(application) 

(variable) 

(letrec) 

(equations) 



But ultimately only a higher-order formulation can be formally satisfactory, thus we propose the 
following representation as a higher-order rewrite system (HRS) iPTBl for Xi etrec . 

Definition 2 (Higher-order representation of Xi etrec ) Let Var be a set of variables, and BTypes a set of 
base types that induce the set Types of simple types. The terms of Xj etrec are simply-typed higher-order 
terms over the HRS-signature that for all neN, and all types Zq, Ti, . . . , T„ e Types contains a symbol 
let„_in of type: 

let„_in : (ti x . . . x T„ -> To x Ti . . . x X n ) -> Tq 



Product types are only used for better readability. We will use the symbols t, s, u for terms in A, 



'letrec- 



Based on this notion of terms, the A/ e , rec -calculus consists of the rewrite relations: j3-reduction, 
17 -reduction, and letrec-unfolding. Additionally, we use the concept of generalised j3 -reduction ifTTTl . 

For example, letrec-unfolding on the informal A/ efr£ . c -terms according to the grammar above could 
be described by the following rewrite rules: 



let/! =si(f ),..., f n = s„(f) in t 



(if fi,...,f n not free in t) 



let/i =si (/),.. ., fn=s n (f) in t(f) ^ v t(let Defs in s 1 (/),..., let Defs in s n (f)) 

Defs 

which, if translated into HRS-rules (using the signature defined in Def.[2]), can take the following form: 

let„_in A/i... f n .(Y,Zi (/),..., Z n (f)) ^ v Y 

let„ _in Xf x ... /„ • (Y (f), Zj (/),..., Z„ (/) ) - v 

F(let„_in A/i .../„. (Zi (/), Zi (/),..., Z„ (/))).. . 

. . . ( let,, _in A/i . . . /„ . (Z„ (/) , Zj (/),..., Z„ (/) ) ) 

Notation 3 (Rewrite relations in Xi etrec ) On A/ e?rec -terms we consider the following rewrite relations: 
j6 -reduction denoted by ->-p; generalised j8 -reduction denoted by -+ g p ; r\ -reduction denoted by -^j 
letrec-unfolding denoted by -> v . 

For each of these rewrite relations the many -step rewrite relation with respect to ->■ will be written 
as and the (strongly convergent) infinite rewrite relation as -»►. 

With the transformation from Section [TJ which is further developed in the next sections, we aim 
to convert a given term t with repetitive reduction patterns into a term t' that does not require these 
reductions to be performed any more, but such that t and t' are 'operationally equivalent', in a sense that 
guarantees that important properties observable during evaluation are preserved. 
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One candidate for a precisely defined notion of operational equivalence is the extension to A/ e , rec -terms 
of 'applicative bisimulation' on A -terms due to Abramsky [1]. Two Xi etrec -terms M and N are called ap- 
plicative bisimilar (symbolically: M ~ B AO if M and N behave in the same way under all possible series 
Eo,E\,E2,. ■ ■ of 'experiments' of the following kind: on a starting term Mq the first experiment Eq con- 
sists in finding out whether or not Mo reduces to an abstraction (a weak head normal form); if the outcome 
Mi of the previous experiment £7 is indeed an abstraction Xx.Ni, then for experiment Ei+\ an arbitrary 
term P;+i is chosen, and it is determined whether or not the redex {Xx.Ni)Pj + \ reduces to an abstraction. 

While applicative bisimulation has been frequently used to justify optimising transformations for 
functional programming languages, there may be a host of other interesting notions of operational equiv- 
alence. Since, for the moment, we do not want to commit ourselves to a particular notion of operational 
equivalence, we will use a syntactically defined notion of equivalence between terms instead. In fact, 
we will define this syntactic notion as the convertibility relation with respect to rewrite relations that we 
use for motivating and justifying the optimising transformation in, for example, Fig. [T] and Fig [2] There, 
we use, in addition to infinite convertibility with respect to -* v and -*- g R, a restricted form of 'vector 
T] -reduction' that is defined by the following rewrite rule: 

Xx\ ■ ■ .x n .Mxi ■ ■ -x n -*■ M (if x\, ... ,x n distinct, and not free in M) 

The induced rewrite relation extends 17 -reduction, but can be mimicked with r\ -steps, and therefore 
has the same many-step relation. However, neither for r\ -reduction nor for vector r\ -reduction it holds in 
generality that the source and the target of a step are applicative bisimilar: for example, in the 77 -reduction 
step Xx.yx y the source is an abstraction, but the target is not. 

Since we want to obtain a syntactically defined notion of operational equivalence that is stronger 
than applicative bisimilarity, we define a restriction ->^ of and a variant -*^p«- of -*ffoi both of 
which serve our purposes and, importantly, only allow steps between applicative bisimilar terms. The 
converse rewrite relation <^ of ->^ performs a copying operation for A -abstraction prefixes in terms; 
and the converse rewrite relation <-fjp«* of -^p cr both copies a A-abstraction prefix and carries out a 
permutation in it. 

Definition 4 (Restriction and variant of -*^) The restricted version ->^ of the rewrite relation on 
A/ efrec -terms is defined by the rule: 

Xx\ . . .x n .{Xx\ . . .x n .M)x\ . . .x n -*■ Xx\...x n .M (if x\, ... ,x n distinct, and not free in M) 

And the extension -^p« of with respect to permuting variable names in abstraction prefixes is 
defined by the rewrite rule: 

Xx\ . . .x n \Xx n (i) . . .x n ( n yM)x n (i) . . .x n ( n ) Xxi...x„.M 

(if x\ , . . . ,x n distinct, and not free in M, 
and 71 is a permutation on {1, ... ,n}) 

It is easy to verify that left- and right-hand sides of these rules are applicative bisimilar. 

Now we define the syntactic notion of equivalence on which we base our transformation. 

Definition 5 (Equivalence relation =™ g g) Infinite convertibility with respect to -* v and -*- g p, extended 
by finitely many -*rjP ci -reduction steps, is the following relation on A; efrec -terms: 

OO / \ X 
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Since source and targets of each of the rewrite relations -> v , -+ g p, and -^p** are applicative bisimilar, 
and since applicative bisimilarity is a contextual congruence [T], the following proposition can be proved, 
which states that the syntactical equivalence from Definition[5]is at least as strong as, i.e. is contained in, 
applicative bisimulation. 

Proposition 6 For all X tetrec -terms M,N it holds: M =™ => M ~ B N. 

3 Further Examples 

The repeat function shows that for some cases it is possible to lift parameters out of recursive positions 
and thereby improve run-time efficiency. That raises the question of when this is possible. What is the 
pattern that allows for such an optimisation? 

What strikes the eye are the occurrences of the syntactic element repeat x on both the left-hand and 
the right-hand sides of the function definition. That suggests a sort of common subexpression elimination 
that takes into account both sides of the equation. This formulation, however, cannot cover the following 
example, which differs from repeat essentially only by an additional parameter n on the left-hand side, 
and the argument n - 1 on the right. 

replicate x — [] 

replicate n x = x : replicate (n — 1) x 

As it was the case for repeat, again it is possible to lift the parameter x out of the recursion. 

replicate n x = let rec = [ J 

rec n = x : rec (n — 1) 
in rec n 

Again we regard both variants and their infinite unfolding (Fig. [2]) to understand the transformation. For 
reasons of clarity, however, we completely leave out the scrutinisation of parameter n and the subsequent 
case discrimination and concentrate on the recursive pattern. 

In comparison with the previous example we observe two differences. First, note the 'header' 
Xn.Xx. [ ] n x attached on top of the second graph, which we obtain by two T] -expansions. This al- 
lows us to produce an optimised term that does not comprise a duplicated function body. Furthermore, 
instead of relying on ordinary j3 -reduction we have to add generalised beta-reductions (g(3 -reduction) 
iTTTi to our arsenal. 

The key pattern shared by the two presented examples that permits optimisation is a parameter p 
that is being passed through unchanged in the recursive application. Consequently, once the function is 
called from the outside with some argument a in p's position, while recursively evaluating that call, p 
can never again be bound to another value than a or a descendant of a. In that sense one might call p a 
'constant parameter'. It suggests itself, that components that are 'constant' to a recursive construct can 
be lifted out of the recursion. 

4 A rewrite rule for simple recursive patterns 

The examples in Section [T] and Section [3] suggest that there are many similar situations in which optimi- 
sations of the kind as described can be carried out. A first attempt to obtain general formulations of such 
simplification steps would be to use schemata, and in effect, rewrite rules on Xi etrec -terms. 
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Figure 2: Operational equivalence of the two replicate variants depicted in graph notation 
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As an example, let us consider the recursive definition of a function / in which the (n + 1 )th parameter 
y is passed on to all recursive calls of / as the (n+ l)th argument. In this case the transformation that 
eliminates the recurrent parameter y can be described by the following first-order rewrite rule on ki etrec 

let / = A24. ... Xx n .Xy.C [f h ... t n y] in/ 
— * 

Xxi. ... Xx n .Xy.\et f = Xx l . ... Xx n .C [/' h ... t n ] in /' x l ... x n 

where C is a A; e?rec -context with possibly more than one occurrence of the context-hole [], / and /' 
do not occur in C, and y does not get bound during hole-filling. Here the recurrent parameter y in the 
recursive definition of / is lifted out of the recursion, and the number of arguments in the recursive call 
of the function in the let-construct has decreased by one after the transformation. Note, that context C 
might start off with initial lambdas, and therefore also covers the case in which y is followed by further 
parameters. 

To cover situations with multiple recursive calls to / with (possibly) varying arguments, we can 
generalise the rewrite rule as follows, as long as the argument in question, y, is the same. 

let / = Axi. ... Xx n .Xy.C [f t{ ... t l n y, ...,/ £f ... C y\ m / 
— > 

X Xl . ... Xx n .Xy.let /' = A./,. ... Xx n .C [/' /,' ... #,...,/' t[ n ... C] in/' .r, ... x n 

C is a context with m sort of holes [] 1 , . . . , [] m , in which holes of each sort may occur more than once, 
where / and /' do not occur in C, and the parameter y does not get bound during hole-filling. 

In order to enhance its application to a bigger class of A/^ rec -terms, the second rule can be further gen- 
eralised to cover situations in which / is only one amongst many functions defined in a letrec-construct, 
or there are also other recursive calls to / that are not of the 'good' form. Namely, if a definition like: 

/ = Xx v ... Xx n .Xy.C [f t\ ... tl „ / /,'" ... c y] 

occurs somewhere in a let-binding, then it can be substituted by: 

/ = A.r . ... Azn.Ay.let /' = Xx ± . ... Xx n .C [/' t\ ... t\ // /' i? ... t™ y] in/' Xl ... x n 

There might be calls of / in C that are of 'bad' shape. These remain unchanged. On A/ efrec -terms, this 
more general transformation can be expressed by the rewrite rule: 

let E [f — X Xl . ... Xx n .Xy.C [f t{ ... t\ y,...,f t™ ... t™ y\\ in M 

let / = Xxi. ... Xx n .Xy. 

let E [/' = Xxl ... Xx n .C [/' tl 
in M 

where E is a context of the form D\,.. . , A-i, [], A, • • • ,A consisting of definitions D\, D,-_i, D i+ i, 
Di and a single hole [ ] . 



•••/,; // /' /r-/,"'// in /'./:..../; 
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4.1 Rewriting the Haskell Prelude 

To demonstrate the above rewriting rule, we apply it to straightforward implementations of some well- 
known functions from the Haskell Prelude. 

map [ — [ 

map f (x : xs) — fx: map f xs 

until p f x = if p x then x else until p f (/ x) 

(-H-) [] ys = ys 

(-H-) (x : xs) ys = x : xs -H- ys 

For the optimised counterparts the amount of j3 -reduction steps saved per recursive call amounts to 
one for map and (++), and to two for until. 

map f = let rec [\ — [ | 

rec (x : xs) —fx: rec xs 
in rec 

until p f x = let rec x = if p x then x else rec (/ x) 
in rec x 

(-H-) xs = let rec [ = 

rec (x : xs) = x : rec xs 
in rec xs 

In practise, however, the amount of /3 -reductions is only one of many factors for the run time that is 
necessary to evaluate a piece of code. Therefore it is to be expected that when executed with a system 
like the Glasgow Haskell Compiler (GHC) the obtained functions would not necessarily lead to better 
performance. Depending on the compiler version and the flags provided in the invocation of GHC, simple 
benchmarks yield mixed results, but never resulting in severe degradation (more than an increase of 5% 
in run time) and with one of the functions (until) consistently reaching a speed-up of over 200%. 

Let us conclude that before integrating the transformation into a compiler for practical purposes an 
analysis on how it interacts with other optimisations remains yet to be done. 

4.2 Limitations of the Rewrite Rules 

While the most general rewrite rule above has proven to be applicable in a number of situations that 
occur in practice, it also has some severe limitations: First and foremost it applies only to patterns with 
immediate recursion, thus it fails to capture the repetitive reduction pattern in the evaluation of the term 
in Fig. [3] Iff and g are not used at further positions, once in the recursion initiated by / a both x and 
y will never be bound to a different value than a. The property leading to this behaviour is the relation 
between the parameters of / and g. In /, if g is called, then x is passed as an argument and thereby bound 
to y. Conversely, y is bound to x in the call of / in g. Thus, we observe a relation between parameters that 
is cyclic. Also for all of the previous example such a parameter cycle exists, however comprising only a 
single parameter. In the following section we shall elaborate further on the idea of parameter cycles. 
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let / x — 

g y = 

in... / a .. 



fy- 



g x ... 



Figure 3: Schematic term involving mutual recursion 



5 Binding Analysis 

In this section we shall develop an analysis that statically recognises repetitive reduction patterns indi- 
cated by parameter cycles, and show how this allows us to eliminate those binders that are part of such 
a cycle. Parameter cycles describe the possibility of a parameter being passed on from function to func- 
tion unchanged, finally arriving at its original position. To detect such cycles we need to analyse which 
subterm might be bound to which variable during the evaluation of a term. 

5.1 Binding Graph 

To this end we introduce a binding relation o — c y x T on variables V and subterms T of a term. It is 
a conservative approximation of which bindings might occur during the evaluation of the term and does 
not distinguish between different descendants of its components. 

Since we need to take a global vantage point, i.e. to uniquely identify the term's syntactic elements 
we assume globally unique variable naming. To this end we could also use positional information, but 
only along with additional technicalities. Therefore, without loss of generality we henceforth assume that 
each abstraction binds a distinct variable (variables in the term are 'distinctly bound'), and no variable 
name has both a free and a bound occurrence (the term observes 'Barendregt's Variable Convention' [|3] 
2.1.13, p. 26]). That allows us to unmistakably address a specific binding Xx by the variable x it bindsQ 

For example for some context C in the reduction of term t = C[(Xx.e\) e?\ the /3-redex {Xx.e{) e2 
might be contracted and by consequence a descendant of ei be bound to an instance of x. Therefore t 
implies xo — e 2 . This principle carries over to gj8 -reduction, and more importantly, to sharing. 

In order to obtain the binding relation for a term t one has to identify all terms in argument position 
that might ever match up with Xx in its reduction. (More precisely: terms whose descendants might ever 
match up with descendants of Xx.) This could be naively accomplished by a search that starts at each 
abstraction Xx and from there travels upwards the spine segment of Xx. When an applicator with a as 
an argument is encountered that matches Xx according to the semantics of g/3 -reduction xo — a is noted 
down. When a multiplexer is encountered the search is pursued for each of the incoming edges. 

The use of higher-order functions makes it impossible to enumerate the binding relation completely. 
Thus, if during the search the end of the spine is reached one cannot make a safe assumptions on 'future' 
arguments for that branch. Therefore we employ an additional 'artificial' blackhole node • that represents 
an unknown term on the right hand side of the binding relation. According to this, the occurrence of 
e\ (Xx.ei) implies xo — •. Note, that the blackhole node is not needed to identify parameter cycles, but 
is however required for the domination property introduced later on. 

Let us revisit the examples presented so far and enlist their binding relation. In the A -graph of repeat 
(Fig. [I]) the search starting from the only abstraction Xx branches at the multiplexer above. The left 

'Even if that forbids distinguishing between different occurrences of the term x, it turns out not to be necessary for our 
needs. 
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branch yields a matching application with x in argument position, so we obtain x o — x. At right branch 
the spine immediately ends yielding xo — •. The directed graph that is induced by this relation features 
a single-node parameter cycle. 





n-\ 



Figure 4: Binding graphs of repeat, replicate, and the term from Fig. 3 



The binding-graph of replicate is very similar featuring the same kind of parameter cycle, only does it 
involve an additional parameter. Also we have to apply the idea of g/3 -reduction when searching upwards 
from Xx. First we encounter another abstraction Xn, which according to the notion of g/3-reduction 
requires us to leave out the next application node. In the left branch of the multiplexer this is the one 
with n - 1 in argument position. Therefore we end up with x as the argument of the matching application 
node, by which we obtain x o — x. Parameter cycles of greater length would occur for scenarios involving 
mutual recursion such as the term in Fig. [3] 

5.2 Inference Rules 

To properly define the binding relation we formulate it using inference rules. Since it follows the structure 
of typing rules, we first give rules for a simply-typed A/ efrec -calculus (Fig. [5]) and then decorate these 
typing rules to also yield a term's binding graph, by which we hope to provide an easier access for those 
who are already familiar with typing rules for the A -calculus. Type variables are denoted by Greek letters. 



Var 
X : T 6 r 

r^x-.z 



Abs 



ru {x ■ z} => e ■ a 
r => Xx.e ■ z -> o 



Letrec 



Vi € {0, ... ,n} : ru {/i : Ti ,...,/„ : Z n } =» e,- : g 
T => letrec f\ = e\ .../„ = e n in <? : T 



App 

r=>«?i :T- 



e 2 -p 



■ e\ e 2 ■ <J 



Figure 5: Typing rules for the simply-typed A/ e;rec -calculus 

In order to infer the binding relation (Fig. [6]>, at any application e\ e^ one has to determine to which 
variable ^2 might be bound to in e\ . That we accomplish by annotating the types of terms with the names 
of the variables that are bound by their abstractions. The annotations are in superscript position, so for 
some annotated types x - z, y - a, ej, ■ p, the annotated type of e\ - Xx.Xy.es is ei : (t ->■ ((T -> p) y ) x . We 
use £ as an annotation to indicate that we cannot determine the associated variable (due to the use of 
higher-order functions). 
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The binding relation is denoted on the left-hand side of the turnstile symbol. Both upper-case and 
lower-case Greek letters denote annotated type variables, but the latter do not include the outermost 
annotation (which are then added explicitly). 

Var Abs 

x-.TeF 8hru{jc:T £ }=>e:I 



Letrec 

Vl e {0,.. ., n} I- ru{/i:7i, ...,/„ :?;}=>£,•: 7} 
Bqu . . .uB n \-r => letrec f\ = e\ .../„ = e n in eo ■ To 

App 

B\ \- F => ei : (T E) x B 2 \-T^e 2 :R bare (T) = bare (R) 



B\ uB2ubind(x,e2) ublackholes(R) h T => e\ ei'-lL 
blackholes(T) = 



'blackholes(R) u{xc^,} if T = (L^R) X 
blackholes(R) if T = (L tf) e 

otherwise 



bind(x,e2) = 



if x = £ 

{x o — e 2 } otherwise 



Figure 6: Inductive definition of the binding relation based on annotated types 

In the Abs rule the abstraction puts a new variable x, which is annotated by an e because we do not 
make assumptions on the variables exposed by higher-order functions. The type of Xx.e is annotated by 
x, which is exposed. To compare types App employs a function bare :A^T, which maps annotated 
types A to bare types T by removing all annotations. Two further functions, bind and blackholes, are 
used to enumerate the elements to be added to the binding relation. In case the argument e 2 is a higher- 
order function the expressions cannot be determined that are bound to the variables it exposes, therefore 
blackholes binds a blackhole to each of them. 

Proposition 7 Every derivation T> in the type system in Fig. ^with conclusion => M : X, which justifies 
the assignment of type T to the Xi etrec -term M, can be decorated effectively with as result a derivation 
T> in the proof system in Fig. ^with conclusion B I- => M ■ f, by which the binding graph B for M is 
obtained and justified. 

The proof of this proposition consists in describing an effective algorithm that, given a derivation V 
in the type system in Fig. [5] proceeds as follows: First it constructs, in a step by step manner, variable 
decorations for the types in V (by concentrating on 'spine loops' of M that correspond to cyclic threads 
in V, and starting with the decorations at formulas on such threads where the types have minimal length) 
in such a way that the rule instances are correct for the system in Fig. [6] when the leading binding 
graph annotations of the form B h are neglected. Second, after the first step is concluded, the algorithm 
constructs the binding graph annotations according to the rules in Fig.[6]in a top-down manner. 
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6 Transformation 

Using the binding graph we will now develop a more general version of the optimisation previously 
formulated as rewriting rules restricted to directly recursive functions. 

An edge xo — e in the binding graph of a term t indicates a (possibly infinite) class of g/3 -redexes 
on the infinite unfolding T of t. While all these redexes could be at once contracted on T this does not 
automatically carry over to its finite representation fj^Let us for the present restrict our attention to cases 
where x has no further incoming edges. 

Proposition 8 Let t be a Xi etrec -term with infinite unfolding T and a binding graph featuring a node x 
with a single incoming edge x o — e. If we label the transition that contracts all g/3-redexes with involving 
a Ax-abstraction red, and the substitution of all occurrences of t by e and the subsequent elimination of 
all vacuous Ax-abstractions trans, then the following diagram commutes. 

t T 
trans I j red 

t' — v r 

This follows from the following considerations. Since we assume unique variable naming, the infinitely 
unfolding t without any renaming is semantics preserving. [8{^Every occurrence of Xv in t gives rise to 
one ore more g/3 -redexes in T each having Xv with p as an argument. This follows from xo — e being 
the sole incoming edge of x. By both paths from t to T' one finds T to have the same shape: There are 
no occurrences of neither Xx nor x. Evidently this not a very strong argument and we hope to improve 
on it by means of higher-order reasoning. 

The restriction above to only consider nodes with a single predecessor prevents us from dealing with 
any of the examples shown so far since they involve cyclic binding graphs. Fortunately, it can be relaxed 
to a much less restrictive property. 

Definition 9 (Domination, and strong domination) Let a G = (V, >-*■} be a directed graph, and u and v 
be vertices of G. We say that v dominates w (v is a dominator for w, symbolically: domc(v,w)) if either 
v = w or v + w and for every path % in G that leads to w but does not contain v it holds that the start vertex 
of 7C is reachable from v; more formally, if Q 

v = wv(v^wa Vmo, . .. ,u n € V n {v}[wo >* U\ »...>* u n = w ==> v >->*«o]) (1) 

Note that v »*w holds if v dominates w. 

And we say that v strongly dominates w (v is a strong dominator for w, symbolically: sdom(}(v,w)) 
if v + w and for every path n in G that leads to w but does not contain v it holds that the start vertex uq of 
% is reachable from v, but does not reside on a common cycle with v, more formally, if: 

V t W A VmOi . . . ,U n 6 V\ {v}[wo >+Ul>+ ...>+U n =W => V >+*UQ A Uq >f*v] (2) 

2 In fact, the reduct of an unfolded Afc^-term does not have to be expressible as a finite term in Xi etrec in general. 
Even if this has only been proven for /i-unfolding but it is assumed that it also holds for X) etrec . The unfolding does not 
preserve unique naming. 

4 The condition [I] could be simplified by taking it to be just the subformula starting with the universal quantification (that 
subformula is true if v = w)\ the longer condition is used here to increase readability. 



Rochel, Grabmayer 



97 



Remark 10 The standard definition of a 'v dominates w' for control-flow graphs (see e.g. ||9!j) requires 
that each path from the start node to w has to pass through v. The definition above is a generalisation 
to directed graphs that does not depend on the existence of a designated start node. Our definition of 
'v strongly dominates w' excludes self-domination (i.e. makes the relation irreflexive), and adds the 
restriction that for all paths from v to w that do not repeatedly pass through v it holds no vertex except 
the starting vertex v resides on a common cycle with v. 

The following proposition suggests an alternative, co-recursive definition of strong domination be- 
tween vertices, which proceeds stepwisely by examining predecessors of the strongly dominated vertex. 

Proposition 11 Let G = (V, >+} be a directed graph. Then for all v, w e V it holds: 

sdomc(v,w) <^^> v + w A v >* + w a w u>+w A u±v=> sdomo(v, u) ) 

For the definition of the optimising transformation, we choose a formulation different from the one 
using rewrite rules described in Section |4] one that is particularly easy to express. Such as j8 -reduction 
can be decomposed into substitution of individual occurrences of variables (local j8 -reduction) and the 
elimination of vacuous bindings (AT-removal) as described in ||4]], for the transformation we have the 
possibility of expressing the transformation with an arbitrary level of granularity. The following rule 
encompasses the substitution of all occurrences of one dominated variable. It could have been formu- 
lated more fine-grained by substituting only individual occurrences, or less fine-grained by including the 
elimination of the bindings that have become vacuous. 

Proposition 12 (Main proposition) In a ki etrec -term t occurrences of a variable that is dominated by an 
expression d in t 's binding graph can be substituted by d. 

B^0=>t:L B=>doniB(d,x) 

7 Advanced Examples 

There are interesting examples to which the presented transformation cannot be directly applied or does 
not lead to satisfactory results. Only in combination with a number well-directed unfoldings the desired 
effect can be obtained. Let us consider two schematic examples: 

Xy. ... let rec = Xx.C [rec x] in ... rec ... 
let / = Xx.Xy.D [f x bj a y] in .../ ... 

The first one exhibits dominated parameters only after a single let-unfolding (Fig. [7]). Also it features 
repetitive reduction pattern, without having a parameter cycle. This because the argument in question is 
bound outside of the recursion. Still, after the unfolding the presented optimisation does cure the term. 

The second of the two term is particularly intricate since it can be unfolded and then transformed 
in many different ways with considerable differences in the amount of concealed g/3-redexes. Most 
solutions involve more than one recursive call with more than two arguments. The ideal transformation 
with respect to the number of concealed redexes of both terms is depicted in Fig. [8] 



98 



Repetitive Reduction Patterns in Lambda Calculus with letrec 




8 Status quo 

Presently this work consists mostly of an extended problem description, which is to a some extent rather 
informal. Ideas for resolution have been presented, but yet lack both precision and genericity. Therefore, 
outstanding issues to continue the investigation suggest themselves. Currently we are working on the 
following problems: 

• Properly formalising the concepts introduced in this report. This involves putting grammars and 
rewrite rules into the higher-order setting of HRSs. 

• Investigate whether there are other interesting notions of 'operational equivalence', apart from 
applicative bisimulation, with respect to which we could show correctness of our optimising trans- 
formation. (We think of notions that are in line with the semantics of functional programming 
languages, but nevertheless are language independent, and also of theoretical interest.) 

• Expressing the presented transformation as a higher-order rewrite system and proving its correct- 
ness with respect to that equivalence relation. 

Once these fundamental issues are resolved, we intend to tackle the following questions: 

• We have seen that unfolding a Xi etrec -term in order to facilitate the optimisation can be effected in 
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x 



Figure 8: Optimisation of two terms that are not covered by the presented rewriting rules 

different ways, leading to terms of different quality. To obtain the most efficient terms one has to 
provide a procedure to select the most suitable unfolding for each situation. 

• This requires an adequate measure for efficiency. 

• We would like to learn more about the rewrite properties of the transformation: is it possible to 
find a formulation that guarantees confluence and normalisation? 

• In some of the easy examples we studied, our transformation seemed to be closely connected 
with the concept of 'lambda-dropping' 015]], and hence also with its converse, 'lambda-lifting' 
ifTOl [T2l l6l . We want to understand that relationship in detail. 

• Once the analysis has been optimised in terms of genericity, i.e. that it recognises as many cases 
as possible for which the transformation is correct, it would be interesting to assess the frequency 
in which these cases occur in existing systems, such as functional programming libraries or inter- 
mediate code generated by compilers. 
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• The remaining question is, how the optimisation actually affects the run-time efficiency of real- 
world systems like Haskell programs. 

Acknowledgment. The incentive to investigate the presented optimisation was provided by Doaitse 
Swierstra. We thank him and Vincent van Oostrom for many insightful discussions and hints. 
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